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ABSTRACT 

Various parallel forms of the NTE (National Teacher 
Examinations) Test of Professional Knowledge (NTETPK) have been 
administered to examinees aspiring to initial teacher certification 
since 1982. To confirm that recent reported scores are comparable to 
those from earlier forms, the stability of the score scale was 
studied using section pre-equating methodology (SPE). THE NTETPK 
consists of four 35-item sections with parallel content. Three 
sections are operational, and one section is non-operational. Three 
sections of a 1983 test form (Form F-4) were separately administered 
(non-operationally) in the fall of 1988 to randomly selected thirds 
of the testing population along with another entire operational form 
(Form K2I). Each of the three sections was given to approximately 
7,500 examinees. Using SPE, maximum likelihood mean and standard 
deviation and variance/covariance estimates were venerated for the 
"newly constituted" F4 form, which was then equated to Form K2I. 
Comparison of the "new" and old conversion tables revealed 
differences ranging from -0.04 to -0.21 points. For all practical 
purposes, these differences are not significant. Within the range 
scored by 90% of the examinees, differences of one point occurred at 
only 8 out of 56 possible scores. Based on these results, it appears 
that there has been virtually no drift in the NTETPK during the 
period 1983 through 1988. A 12-item list of references is included. 
There are five tables of statistical data and three figures 
illustrating the study. (SLD) 
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Abstract 

The NTE Test of Professional Knowledge is administered annually to over 85,000 examinees 
>^o aspire to receivt initial teacher certification. Various parallel forms of the test have been 
administered since 1982 and scores are reported on a common scale. To confirm that reported 
scores on recent forms of the test are comparable to those from earlier forms, a study of the 
stability of the score scale was conducted utilizing the methodology of Section Pre-Equating (SPE). 

The NTE Test of Professional Knowledge consists of four 35-item sections with p arallel 
content. Three sections are operational; one is non-operational. The total raw score is the sum of 
the operational section raw scores. One or more new forms are developed every year, each form 
containing from 20 to 35 items in common with a previous form. For this study, three sections of 
a 1983 test form (constituting the entire operational form - Form F4) were separately re- 
administered non-operationally in the Fall of 1988 to random thirds of the testing population, along 
with another entire operational form (Form K2I). Using SPE, maximum likelihood mean and 
standard deviation and variance/covariance estimates were generated for the "newly-constituted" 
F4-form. The "new" F4-form was then equated to Form K2I. 

Comparison of the "new" and old conversion tables revealed differences ranging from -0.04 
to -0.21 points. For all practical purposes, these differences are not significant, especially across five 
intervening equatings and with SEMs of 4.1 and 3.7 for the F4-form and the K2I-form, respectively. 
The rounded integer scores, reported to examinees, differ by no more than 1 score point. These 
larger differences are simply an artifact of the integer rounding. In the reported score range 
between 630 and 661 (the minimum and maximum state qualifying scores), non-zero integer 
differences were found only at four scores. Within the range scored by 90% of examinees, 
differences of one point occurred at only eight scores. Based on these results, it appears that there 
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was virtually no drift in the NTE Test of Professional Knowledge scale during the five year period 
1983 through 1988. 
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Scale Stability of the 
N'lE Core Batteiy Test of Professional Knowledge' 

Standard 4.9 of the APA/AERA/NCME Standards for Educational and Psychological 
Testing (1985) recommends that "continuing testing programs that attempt to maintain a common 
scale over time...conduct periodic checks of the stabilify' of the scale" (page 34). Likewise, the £3S 
Standards for Quality and Fairness (Educational Testing Service, 1987) require ETS testing 
programs to periodically evaluate the stability of the score scale being used. 

The 1986 and 1989 Audit Reports of NTE Programs indicate that a decision was made to 
study score scale stability when the Core Batteiy had been administered for five years. This study 
concerns itself with only the Test of Professional Knowledge and one particular study methodology, 
[Section Pre-Equating (SPE)]. A plan to study scale stability in the remaining Core Battery tests 
will be developed at a later date. 

One method to study scale stability is to administer a given test form at widely separated 
times and equate the form to itself through multiple links. Comparison of the resulting tv'o raw-to- 
scale conversion tables will give an indication of any scale drift or instability. For scales without any 
drift, the conversion tables would be virtually identical. Any differences could be attributed to non- 
equivalence of the testing populations (or samples), to measurement error, or to equating error 
(Brennan & Kolen, 1986). 

The scale stability of several tests developed at ETS has been investigated over the years. 
Stewart (1966) studied the stability of the SAT- Verbal score scale over a fifteen year period using 
Tucker and Levine linear equating and equipercentile nonlinear equating, with conrimon item 



'Paper presented at the annual meeting of the Northeastern Educational Research Association 
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anchors. She found relative^ small changes around the mean between 1953 and 1963 scores, and 
between 1957 and 1963 scores; changes of 20-35 points (on a scale of 200-800) were found between 
1948 and 1963 scores. Using similar equating methods, Modu and Stem (1975) report finding 
average differences of 14 and 17 points for the SAT-Verbal and SAT-Mathematical scores, 
respective^, when comparing conversion tables for 1963 and 1966 with those of 1973. Modu and 
Stem (1977) confirmed differences of 8-10 points on the SAT- Verbal Scale between 1963 and 1973 
score scales, using an alternative equating method. 

Angell and Schneider (1989) investigated the scale stability of the Test of English for 
International Communication (TOEIC) between 1984 and 1988 with 4 and 5 intervening equatings. 
Using an internal anchor test nonequivalent groups data collection design, Tucker and Levine 
equating models, and the bisection of double-part equating, they found differences of over 16.3 
points on the TOEIC Listening Comprehension scale which ranges from 5 to 450, and less than 3 
points on the TOEIC Reading Comprehension scale (also 5 to 450). 

To my knowledge, no stability studies have been published using the Section Pre-equating 
methodology adopted in the current study. 
Description of the Test 

The NTH Core Battery consists of the three separate tests: Test of Communication Skills, 
Test of General Knowledge, and Test of Professional Knowledge. These tests were first 
administered in November 1982. The Test of Professional Knowledge consists of four 35-item 
sections with parallel content, each separately timed for 30 minutes. Three sections are operational 
(count toward an examinee's score) and one is non-operational (not included in an examinee's 
score). Generally, the non-operational section is used for pretest puq oses, albeit not for this study, 
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as described below. The total raw score is the sum of the operational section raw scores, the 
number of right answers on the items. 

One or more new forms are developed every year; each form containing from 20 to 35 items 
in common with a previous form. Common item linear equating (using internal anchors, 
nonequivalent groups, and Tucker and Levine equating models) is typically used to place the raw 
total scores on the reported score scale. Scores are reported on a scale of 600 to 695, in one point 
intervals. 

Insert Figure 1 about here 

Figure 1 shows the linkage between form SENT and subsequent forms important to this 
study. Form SENT was first administered in November 1982 and contained 3 operational sections 
(Ei.j) with 104 scored items, and 1 non-operational section. Subsequent forms 3FNT1 and 3FNT2 
had 35 items and 34 items, respectively, in common with 3ENT, and 33 items in common with each 
other. Forms 3FNT3 d 3FNT4 were first administered in late 1983; subsequently, forms 3GNT, 
3HNT and 3INT were introduced at yearly intervals thereafter. Forms K-3INT (form 3INT with 
several revised items) and K2-3INT (form K-3INT slightly revised) were introduced in 1988. 
Design of the Scale Drift Study 

Scale stability can be inves;>5ated administering a given test form at widely separated 
times and then equating the newly administered form to a recent form that is on scale, resulting in 
a "new" conversion table for that old form. Comparison of the resulting two raw-to-scale conversion 
tables (original and "new") will give an indication of any scale drift or instability. For scales without 
any drift, the conversion tables would be virtually equivalent. 
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To facilitate this scale stability study, Test Development staff assembled three versions of 

form K2-3INT, each of which contained one of the three different operational sections from form 

3FNT4 as a non-operational section 4: 

K2-3INT, contained section 1 of 3FNT4, 
K2-3INT2 contained section 2 of 3FNT4, and 
K2-3INT3 contained section 3 of 3FNT4. 

That is, sections 1, 2, and 3 of 3FNT4 were operational when that form was administered as an 

intact form in 1983, but were considered non-operational ^en administered as section 4 in 

K2-3INT,.3. 

Form 3FNT4 (the F4-form) was first administered in October 1983 to 4,516 examinees. Two 
items were PINned^ at that time. Each of the three K2-3INT forms (I-forms) was given in October 
1988 to approximately 7,500 examinees. Hie forms were distributed in a spiralled arrangement at 
each test center. Section Pre-Equating methodology was then used to statistically combine these 
F4 sections and equate the combination ("new" 3FNT4 form) to the K2-3INT form. The "new" 
conversion table that results from this procedure was then compared to the original 3FNT4 
conversion table. 

The original conversion parameters for form 3FNT4 were obtained by averaging the results 
of the following three equatings: 

1) total raw scores on 3FNT4 were equated, using Tucker equating, through scores on 33 
common items to scores on 3FNT1; 

2) total raw scores on 3FNT4 were equated, using Tucker equating, through scores on 68 
common items to scores on 3FTN2; 



* Items 10 and 16 in section 1. Form K2-3INT, also contained the two PINned items from 
3FNT4. These items were designated DNS items on both forms. 
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3) total raw scores on 3FNT4 were equated by setting means and standard deviations equal to 
scores on form 1)FNT3 (which was spiralled with 3FNT4) and averaging the result with the 
bisector of the two Tucker equatings. 

Methodology 

The technique of Section Pre-Equating (SPE) is ideally suited for the situation in which 
random groups of examinee.s take only portions of a complete test form, the whole of which must 
be equated to a form all examinees have taken (Holland & Wightman, 1982; Petersen, Kolen & 
Hoover, 1989; Wightman & Wightman, 1988). Typically, SPE is used when a section (or sections) 
of non-operational items is included within a complete (operational) test form; several versions of 
the form are developed, differing only in their non-operational section(s). These versions are then 
spiralled among examinees. Following the administration, it is possible to use the resulting 
combination of complete information on the operational items and incomplete information on the 
non-operational items to estimate the parameters of the desired equating function for the new test 
form. In this study, examinees took all three I-form sections and only one of three possible F4-form 
sections, as illustrated below. 



Sections 



Subform 


I. 




I3 


F4, 


F4j F4. 


K2-3INT, 


X 


X 


X 


X 




K2-3INTj 


X 


X 


X 




X 


K2-3INT3 


X 


X 


X 




X 



That is, form K2-3INTi contained sections I„ Ij, la, and F4,. This is the one-variable section pre- 
equating design as described in detail by Holland and Wightman (1982) and by Wightman and 
Wightman (1988). 
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The goal is to equate the raw scores from the three non-operational F4-fonn sections 
(combined across sections) to the raw scores based on all the I-form items by the use of SPE. 
Linear equating can proceed once we have the observed means and standard deviations for the 
I-form and the maximum likelihood estimated mean and standard deviation and variance/covariance 
estimates for the F4-form sections on that form. 

Results 

Conversions 

The means and standard deviations for the three 3FNT4 sections at both administrations 
are reported in Table 1. Means and standard deviations of the I-form sections are also reported. 
The 1988 group had higher means (about .9 points) and smaller standard deviations for all three 
F4 sections compared to the 1983 group of examinees. Note that the I-form sections appear to be 
slightly easier than the F4-form sections. The standard error of measurement (SEM) for the 
original F4-fonn was 4.1; for the I-form, 3.7. 

Insert Table 1 about here 

The covariances and correlations between sections and ths section means are listed in Table 
2. Because a different one/third of the examinees took each of the F4 sections, values shown in 
the table involving the F-form sections are maximum likelihood estimates calculated through the 
SPE process. 

Insert Tables 2 and Table 3 about here 
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The parameters for converting the original F4*form raw scores to scale were A«,^ « .9379 
and B,„^ » 594.4079. The SPE analysis produced new estimates of the conversion parameters, 
= .9363 and = 594.3650. 

Table 3 details the differences between the original and new conversion lines. Column 1 
shows the possible total raw scores from 0 to 103^ Column 2 contains the p>ercentage of examinees 
scoring below a particular raw score (based on the original F4-form administered in 1983); this 
provides an indication of the distribution of scores. Columns 3 through 5 show the original 
converted scaled scores, the newly converted scaled scores and their differences, respectively. 
Differences in unrounded scaled scores ranged from -0.04 to -0.21. The last columns show the 
scaled scores rounded to integers (as they are reported to examinees^) and the 12 score points out 
of 91 for which the new score values differed by one point from the original scale value. Plots of 
the two conversion lines are shown in Figures 2 and 3. 

Almost ninety percent of the examinees score between 620 and 675 (using the original 
conversions); within this range, the rounded differences are not zero at 8 points. Currently, 22 
states stipulate a qualifying scor,; on the NTE Test of Professional Knowledge as part of their 
certification requirements. The qualifying scores range from a minimum of 630 to a maximum of 
661. Within this critical range, the original rounded converted scores were identical to the "nev** 
rounded converted scores except for four score points, for which the original values exceeded the 
new values by just one point. 



^ The total raw score is the sum of raw scores from the three sections of 35 items each minus 
two PINned items. 

* Converted scores below 600 are reported as 600. 
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Insert Table 4 about here 

Scaled score means, standard deviations, and conversion parameters for the various forms 
are shown in Table 4. When the original and new conversion parameters for 3FNT4 are used with 
both examinee groups, the difference in scaled score means between conversions is 0.26 for 1988 
examinees and -0.15 for 1983 examinees. 
Standard Error of Equating 

The standard error of equating (SEE) expresses the amount of error in test form equating 
that is due to using examinees from a particular administration date. For a given score on one form 
of the test, the error in estimating its equated score on another form is indexed by a standard error. 
la order to better evaluate the difference between the two conversions, SEEs for an illustrative 
equating were calcilated. The SEE formula for SPE has not yet been developed; however, one SEE 
formula for linear equating that may be appropriate lor our situation is from Lord [see Angoff 
(1984), pg. 106]: 



SEE = 



2a/(l-r2)[ —5 — ] 



This particular formula assumes that (1) the distributions of examinee scores are normal, 
(2) both examinee groups are random samples from the same population, and (3) a set of common 
items are administered to each examinee group. Most equatings for the Test of Professional 
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Knowledge follow these assumptions, so that calculating this set of SEEs will provide preliminary 
information to help evaluate the conversions. For these calculations, the value of r used was .79^. 

Insert Table 5 about here 

The standard errors of equating shown in Table 5 range from .72 to .15 for the mean scaled score 
of 658.61, when N= 10,000; they range from .51 to .11 for N=20,000. 

Discussion 

Comparison of the new converted scores with the original scores, in Fable 3, shows scaled 
score differences of -0.04 to -0.21. For all practical purposes, these differences are not significant, 
especially across five intervening equatings and with SEMs of 4.1 and 3.7 for the F4-form and the 
I-form, respectively. 

The rounded integer scores, reported to examinees, differ by no more than 1 score point. 
These larger differences are simply an artifact of the integer rounding: scores with fractions just 
below .5 rounded down, while those with fractions of .5 or just above, rounded up. In the reported 
score range between 630 and 661 (the minimum and maximum state qualifying scores), non-zero 
differences were found only at scores 637, 638, 652, and 653. Ninety percent of examinees score 
between 620 and 675 (using the original conversions); within this range, the reported score 
differences of one point occurred only at eight out of 56 possible scores. 

If the two sets of conversion parameters are indeed equivalent, then they should obtain 
similar results when applied to a particular set of examinee data. As shown in Table 4, when using 
the original and "new" sets of conversion parameters with the 1983 examinee data, the difference 



' The correlation between scores on the common item set and the non-common item set was 
.7904 for Form 3HNT and .7934 for Form 3INT. 
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in scaled score means is 0.26; for 1988 examinee data, the difference is -0.15. That is, if the "new" 
conversion parameters were used with the 1983 data, the unrounded scores would be .26 points 
higher, on average, than originally determined. 

Conclusion 

Based on these results, it appears that there was virtually no drift in the Test of Professional 
Knowledge scale during the five year period 1983 through 1988. 
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Table 1. Means and Standard Deviations for the Operational 
Sections from Original and Latest Administrations 
NTE Test of Professional Knowledge 



1 


Original 
F4-Form 
1983 Admin. 


I-Form 
1988 Admin. 


New 
F4-Form 
1988 Admin. 


1 1" Operational Section 


21.75 
4.65 


23.93 
4.74 


22.62 
4.62 
N= 7,487 


1 2"^ Operational Section 


22.69 
5.28 


23.98 
5.66 


23.43 
5.18 
N= 7,442 


1 3*^ Operational Section 


21.59 
5.29 


23.23 
5.15 


22.59 
5.22 
N= 7,462 


Total Operational Score 


66.04 
13.88 
N= 4,516 


71.15 
14.25 
N=22,391 


68.62* 
13.21* 
N= 22,391 



' Maximum likelihood estimates from SPE analysis. 

E-\DXA\DRlFr\DRlFn.NEW 



Table 2. Maximum Likelihood Estimates of Covariance, Correlations, 
and Means* for NTE Test of Professional Knowledge, forms K2-3INTj^* 







I-form 




F4-form 




Section 1 


Section 2 


Section 3 


Section 1 


Section 2 


Section 3 


M 


22.502 


0.756 


0.742 


0.697 


0.727 


0.744 


1-2 


20.283 


32.014 


0.777 


0.733 


0.758 


0.775 


1-3 


18.126 


22.648 


26.530 


0.711 


0.739 


0.776 


F4-1 


15.179 


19.049 


16.821 


21.071 


0.636 


0.656 


F4-2 


17.823 


22.166 


19.697 


15.092 


26.741 


0.680 


F4-3 


18.611 


23.129 


21.074 


15.866 


18.540 


27.803 


Means 


23.929 


23.984 


23.234 


22.645 


23.409 


22.563 



' Covariances are below the diagonal; correlations are above the diagonal; means are the bottom 
row. 

* Statistics involving only operational sections on the I-form are actual values; those involving 
sections on the F-form are maximum likelihood estimates from the SPE analysis. 
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Table 3. Differences between Scaled Scores 
Based on ths Original and New Conversion Parameters 
NTE Test of Professional Knowledge, form 3FNT4 





UNROUND 




REPORTED 


Raw 


% 


Onginal 


New 


IxvSlUUfU 


ungmat 


new 




Score 


Below 


Scale 


Scale 




Scale 


Scale 




0 


U.U 


K€%A At 

5y4.41 


394.37 


-0.04 


600 




0 


1 
X 


U.U 




373.3U 


-0.04 


600 


600 


0 


L 


U.U 


<Q< 7Q 


^06 74 


-0.05 


600 


600 


0 


D 


U.U 


<07 77 
37 / .ZZ 


^07 17 


-0.05 


600 


600 


0 


A 


U.U 


370.10 


^QA 11 
370.X1 


-0.05 


600 


600 


0 


1 c 


0.0 


cnn iA 


<on A< 


-0.05 


600 


600 


0 


II 6 
II 


nn 




599 98 


-0.05 


600 


600 


0 




nn 

U.U 




60092 


■O.05 


601 


601 


0 


li SI 
II 


U.U 


6ni Qi 


601 86 


-0.06 


602 


602 


0 


0 


n n 

U.U 


6A7 fl^ 

UUZ.03 


607 70 


-0.06 


603 


603 


0 


10 


0.0 


603.79 


603.73 


-006 


OU4 


604 


n 
u 


11 


U.U 


Dtrt. 


604 66 


-006 


605 


605 


0 


IZ 


U.U 


OUj.OO 


0U3.0U 


-0.06 


606 


606 


0 


13 


U.U 


ouo.ou 


606 ^4 


-0.06 


607 


607 


0 


1/1 

IH 


U.U 


6n7 ^ 


607 47 

OU / .t / 


-0.07 


nan 

QUO 


607 


-1 


15 


0.0 


£.i\0 AO 

608.48 


o0o.41 


-0.07 


OUo 


608 

OUO 


n 




nn 


609 41 


609.35 


-0.07 


609 


609 


0 


17 
1 / 


nn 

U.U 


61 n 


610 28 


-0.07 


610 


610 


0 


Ifi 
lo 


n n 

U.U 


All 70 

oii.zy 


611 77 


-0.07 


611 


611 


0 


io 

ly 


n n 

U.U 


A17 71 
01Z.Z3 


617 1^ 


-0.07 


612 


612 


0 


20 


0.1 


613.17 


613.09 


-0.07 


613 


All 

ol3 


A 

u 


oi 

Zl 


n 1 
U.l 


A14 in 

014.1U 


614 n'l 

01H.U3 


-008 


614 


614 


0 


22 


A 1 
U.l 


Ai^ rL4 
013 .U4 


A14 OA 
01*f.V0 


-0.08 


61S 


615 


0 




n 9 
u.z 


A1< Ofi 
013.VO 


A1 ^ 00 
013 .tU 


-0.08 


616 


616 


0 




n 7 
u.z 


616 Q7 


616 84 


-0.08 


617 


617 


0 


25 


A 1 

0.3 


ol7.8o 


01 // 


-0.08 


618 
OlO 


618 


n 


26 


03 


618.79 


618.71 


-0.08 


619 


619 


0 


97 
^ f 


n s 


619 7*^ 


619 65 


-0.09 


620 


620 


0 




n7 


620 67 


620.58 


-0.09 


621 


621 


0 


29 


0.8 


621.61 


621.52 


-0.09 


622 


622 


0 


30 


0.8 


622.54 


622.45 


-0.09 


623 


622 


-1 


31 


1.0 


623.48 


623.39 


-0.09 


623 


623 


0 


32 


1.4 


624.42 


624.33 


-0.09 


624 


624 


0 



E:\DXA\DRI Fr\DRIFn.NEW 






UNROUND 




REPORTED 


H ivaW 


VO 


V/ri(JlIial 




l\CSluUal 




New 


Diff, 


1 Score 


Below 


Scale 


Scale 




Scale 


Scale 








625 36 


625 26 


■0.10 


625 


625 


0 


34 


1.8 


626.30 


626.20 


■0.10 


626 


626 


0 








/ •X*ff 


■0.10 


627 


627 


0 


36 


2.3 


628.17 


628.07 


■0.10 


628 


628 


0 


37 


2.7 


629.11 


629.01 


■0.10 


629 


629 


0 


DO 






62994 


■0.10 


630 


630 


0 


39 


38 


630.99 


630.88 


■0.11 


631 


631 


0 


•tu 








■0.11 


632 


632 


0 


41 


4.9 


632.86 


632.75 


■0.11 


633 


633 


0 


42 


5.5 


63380 


633.69 


■0.11 


634 


634 


0 


43 


6.2 


634.74 


634.63 


■0.11 


635 


635 


0 


44 


7.0 


635.68 


635.56 


■0.11 


636 


636 


0 


*tj 


7 (\ 




636 SO 


■0.11 






-1 


46 


8.6 


637.55 


637.43 


■0.12 


638 


637 


-1 


47 


9.4 


638.49 


638.37 


■0.12 


638 


638 


0 


•to 


10 4 


63Q 43 


639 31 


■0.12 


639 


639 


0 


49 


118 


640 37 


640.24 


■0.12 


640 


640 


0 




1*^ 1 




fA^ 18 


■0.12 


641 


641 


0 


51 


14.4 


642.24 


642.12 


■0.12 


642 


642 


0 


52 


15.8 


643.18 


643.05 


■0.13 


643 


643 


0 






64412 


64399 


■0.13 


644 


644 


0 


54 


18.8 


645.05 


644.93 


■0.13 


645 


645 


0 






645 QQ 


645 86 


■0.13 


646 


646 


0 


56 


22A 


646.93 


646.80 


■0.13 


647 


647 


0 


57 


24.0 


647.87 


647.73 


-0.13 


648 


648 


0 




26 1 


648 81 


648 67 


■0.14 


649 


649 


0 


59 


28.1 


649.74 


649.61 


■0.14 


650 


650 


0 


DU 








■0.14 


651 


651 


0 


61 


32.5 


651.62 


651.48 


-0.14 


652 


651 


-1 


62 


35.2 


652.56 


652.42 


■0.14 


653 


652 

W W 


-1 




37 S 


653 50 


65335 


■0.14 


653 


653 


0 


64 


40.0 


654.43 


654.29 


•0.15 


654 


654 


0 


65 


42.0 


655.37 


655.22 


■0.15 


655 


655 


0 


66 


44.8 


656.31 


656.16 


■0.L5 


656 


656 


0 


67 


47.7 


657.25 


657.10 


■0.15 


657 


657 


0 


68 


50.2 


658.19 


658.03 


■0.15 


658 


658 


0 




E:\DXA\DRIFT\DIUFT2.NEW 

20 





UNROUND 


ED 


REPORTED 


H Raw 


Vo 


yJilginal 


fNCW 


IveSluUal 






DifT 


\ Score 


Below 


Scale 


Scale 




Scale 


Scale 




69 


52.6 


659.12 


658.97 


-0.15 


659 


659 


0 


70 


55.2 


660.06 


659.91 


-0.15 


660 


660 


0 


71 


58.1 


661.00 


660.84 


-0.16 


661 


661 


0 


72 


60.9 


661.94 


661.78 


•0.16 


662 


662 


0 


1 73 


63.5 


662.87 


662.71 


-0.16 


663 


663 


0 


74 


66.0 


663.81 


663.65 


-0.16 


664 


664 


0 


75 


69.3 


664.75 


664.59 


-0.16 


665 


665 


0 


76 


72.2 


665.69 


665i2 


•^.16 


666 


666 


0 


77 


74.8 


666.63 


666.46 


-0.17 


667 


666 


-1 


78 


77.6 


667.56 


667.40 


-0.17 


668 


667 


-1 


1 79 


80.3 


668.50 


668.33 


-0.17 


669 


668 


-1 


1 80 


82.7 


669.44 


669.27 


-0.17 


669 


669 


0 


1 81 


852 


670.38 


670.21 


-0.17 


670 


670 


0 


82 


87.3 


671.32 


671.14 


-0.17 


671 


671 


0 


83 


89.5 


672.25 


672.08 


-0.18 


672 


672 


0 


84 


91.3 


673.19 


673.01 


-0.18 


673 


673 


0 


85 


92.9 


674.13 


673.95 


-0.18 


674 


674 


0 


86 


94.2 


675.07 


674.89 


-0.18 


675 


675 


0 


87 


95.5 


676.01 


675.82 


-0.18 


676 


676 


0 


88 


96.1 


676.94 


676.76 


-0.18 


677 


677 


0 


89 


96.9 


677.88 


677.70 


-0.19 


678 


678 


0 


90 


97.7 


678.82 


678.63 


-0.19 


679 


679 


0 


91 


98.6 


679.76 


679.57 


-0.19 


680 


680 


0 


92 


99.2 


680.69 


680.50 


-0.19. 


681 


681 


0 


93 


99.4 


681.63 


681.44 


-0.19 


682 


681 


-1 


94 


99.7 


682.57 


682.38 


-0.19 


683 


682 


-1 


95 


99.8 


683.51 


683.31 


-0.19 


684 


683 


-1 




99.9 


684.45 


684.25 


-0.20 


684 


684 


0 


97 


100.0 


685.38 


685.19 


-0.20 


685 


685 


0 


98 


100.0 


686.32 


686.12 


-0.20 


686 


686 


0 


99 


100.0 


687.26 


687.06 


-0.20 


687 


687 


0 


100 


100.0 


688.20 


688.1 ; 


-0.20 


688 


688 


0 


101 


100.0 


689.14 


688.93 


-0.20 


689 


689 


0 


102 


100.0 


690.07 


689.87 


-0.21 


690 


690 


0 


103 


100.0 


691.01 


690.80 


-0.21 


691 


691 


0 



E:\DXA\DRIFr\DRimNBW 




21 



Table 4. Scaled Score Statistics 
NTE Test of Professional Knowledge 





1988 Data 


1983 Data 


1 


K2-3INT 


3FNT4 


3FNT4 






Original 
Scale* 


New 
Scale' 


Original 
Scale' 


New 
Scale* 


Mean 


658.61 


658.35 


658.61 


656.88 


6.56.73 


Standard 
Deviation 


12.37 


12.39 


12.37 


12.78 


12.76 


Mean 
1 Difference 




0.26 


•0.15 



' Original conversion parameters (A =0.9363, B= 594.3650) are used with specified data. 
* New conversion parameters (A =0.9379, B= 594.4079) are used with specified data. 
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Table 5. Approximate' Standard Errors of Equating 
Test of Professional Knowledge 



ocaiea 
Score 


dtafiiiara 
Error of 
Equating 
NbIO 000 


dtanaara 
Error of 
Equating 
N» 20.000 




07191 


0 5085 


uuu 




04701 


O iU 


n ssfti 
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0 '^Q47 


ozu 










0 9d7Q 


O'tU 






ODU 


n 1701 


U. iZOD 


meBfi 


0 1517 


0,1073 


660 


0.1525 


0.1078 


670 


0.1971 


0.1394 


680 


0.2808 


0.1986 


690 


0.3786 


0.2677 


695 


0.4298 


0.3039 



' Formula is appropriate for the typical linear equating done for this test; it is not appropriate 
for Section Pre-Equating. The N-sizes used for the tabled values approximate those found in the 
data. 
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Figure 1. Equating Chain' for selected forms between 3ENT and K2-3INT 
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' The number of items common between forms is listed between the boxes. 
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Figure 2. Scaled Score Conversion Lines for 
Test of Professional Knowledge, form 3FNT4 
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Figure 3. Reported Scaled Score Conversion Lines for 
Test of Professional Knowledge, form 3FNT4 
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